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Abstract 

The capacity region of the two-user Gaussian Interference Channel (IC) is studied. Three classes of channels are considered: 
. . . weak, one-sided, and mixed Gaussian IC. For the weak Gaussian IC, a new outer bound on the capacity region is obtained that 

OO ' outperforms previously known outer bounds. The sum capacity for a certain range of channel parameters is derived. For this 

' range, it is proved that using Gaussian codebooks and treating interference as noise is optimal. It is shown that when Gaussian 

I codebooks are used, the full Han-Kobayashi achievable rate region can be obtained by using the naive Han-Kobayashi achievable 

04 ■ scheme over three frequency bands (equivalently, three subspaces). For the one-sided Gaussian IC, an alternative proof for the 

Sato's outer bound is presented. We derive the full Han-Kobayashi achievable rate region when Gaussian codebooks are utilized. 
^ ' For the mixed Gaussian IC, a new outer bound is obtained that outperforms previously known outer bounds. For this case, the 

sum capacity for the entire range of channel parameters is derived. It is proved that the full Han-Kobayashi achievable rate region 
using Gaussian codebooks is equivalent to that of the one-sided Gaussian IC for a particular range of channel parameters. 



OO 



Index Terms 

Gaussian interference channels, capacity region, sum capacity, convex regions. 

O . I. Introduction 
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NE of the fundamental problems in Information Theory, originating from [1], is the full characterization of the capacity 
_ region of the interference channel (IC). The simplest form of IC is the two-user case in which two transmitters aim to 
^ convey independent messages to their corresponding receivers through a common channel. Despite some special cases, such 
as very strong and strong interference, where the exact capacity region has been derived [2], [3], the characterization of the 
capacity region for the general case is still an open problem. 
■f—i A limiting expression for the capacity region is obtained in [4] (see also [5]). Unfortunately, due to excessive computational 
complexity, this type of expression does not result in a tractable approach to fully characterize the capacity region. To show 

■ the weakness of the limiting expression, Cheng and Verdu have shown that for the Gaussian Multiple Access Channel (MAC), 
00 ! which can be considered as a special case of the Gaussian IC, the limiting expression fails to fully characterize the capacity 

■ region by relying only on Gaussian distributions [6]. However, there is a point on the boundary of the capacity region of 
the MAC that can be obtained directly from the limiting expression. This point is achievable by using simple scheme of 
FrequencyATime Division (FD/TD). 

The computational complexity inherent to the limiting expression is due to the fact that the corresponding encoding and 

■ decoding strategies are of the simplest possible form. The encoding strategy is based on mapping data to a codebook constructed 
from a unique probability density and the decoding strategy is to treat the interference as noise. In contrast, using more 
sophisticated encoders and decoders may result in collapsing the limiting expression into a single letter formula for the 
capacity region. As an evidence, it is known that the joint typical decoder for the MAC achieves the capacity region [7]. 
Moreover, there are some special cases, such as strong IC, where the exact characterization of the capacity region has been 
derived [2], [3] where decoding the interference is the key idea behind this result. 

In their pioneering work, Han and Kobayashi (HK) proposed a coding strategy in which the receivers are allowed to decode 
part of the interference as well as their own data [8]. The HK achievable region is still the best inner bound for the capacity 
region. Specifically, in their scheme, the message of each user is split into two independent parts: the common part and the 
private part. The common part is encoded such that both users can decode it. The private part, on the other hand, can be 
decoded only by the intended receiver and the other receiver treats it as noise. In summary, the HK achievable region is the 
intersection of the capacity regions of two three-user MACs, projected on a two-dimensional subspace. 

The HK scheme can be directly applied to the Gaussian IC. Nonetheless, there are two sources of difficulties in characterizing 
the full HK achievable rate region. First, the optimal distributions are unknown. Second, even if we confine the distributions to 
be Gaussian, computation of the full HK region under Gaussian distribution is still difficult due to numerous degrees of freedom 
involved in the problem. The main reason behind this complexity is the computation of the cardinality of the time-sharing 
parameter 



'An earlier version of this work containing all the results is reported in Library and Archives Canada Technical Report UW-ECE 2007-26, Aug. 
2007 (see |http://www.cst.uwaterloo.ca/pub-tech_rep.htnil| for details). 
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Recently, reference [9], Chong et al. has presented a simpler expression with less inequalities for the HK achievable region. 
Since the cardinahty of the time-sharing parameter is directly related to the number of inequahties appearing in the achievable 
rate region, the computational complexity is decreased. However, finding the full HK achievable region is still prohibitively 
complex. 

Regarding outer bounds on the capacity region, there are three main results known. The first one obtained by Sato [10] 
is originally derived for the degraded Gaussian IC. Sato has shown that the capacity region of the degraded Gaussian IC is 
outer bounded by a certain degraded broadcast channel whose capacity region is fully characterized. In [11], Costa has proved 
that the capacity region of the degraded Gaussian broadcast channel is equivalent to that of the one-sided weak Gaussian IC. 
Hence, Sato outer bound can be used for the one-sided Gaussian IC as well. 

The second outer bound obtained for the weak Gaussian IC is due to Kramer [12]. Kramer outer bound is based on the 
fact that removing one of the interfering hnks enlarges the capacity region. Therefore, the capacity region of the two-user 
Gaussian IC is inside the intersection of the capacity regions of the underlying one-sided Gaussian ICs. For the case of weak 
Gaussian IC, the underlying one-sided IC is weak, for which the capacity region is unknown. However, Kramer has used the 
outer bound obtained by Sato to derive an outer bound for the weak Gaussian IC. 

The third outer bound due to Etkin, Tse, and Wang (ETW) is based on the Genie aided technique [13]. A genie that provides 
some extra information to the receivers can only enlarge the capacity region. At first glance, it seems a clever genie must 
provide some information about the interference to the receiver to help in decoding the signal by removing the interference. In 
contrast, the genie in the ETW scheme provides information about the intended signal to the receiver. Remarkably, reference 
[13] shows that their proposed outer bound outperforms Kramer bound for certain range of parameters. Moreover, using a 
similar method, [13] presents an outer bound for the mixed Gaussian IC. 

In this paper, by introducing the notion of admissible ICs, we propose a new outer bounding technique for the two-user 
Gaussian IC. The proposed technique relies on an extremal inequality recently proved by Liu and Viswanath [14]. We show 
that by using this scheme, one can obtain tighter outer bounds for both weak and mixed Gaussian ICs. More importantly, the 
sum capacity of the Gaussian weak IC for a certain range of the channel parameters is derived. 

The rest of this paper is organized as follows. In Section 11, we present some basic definitions and review the HK achievable 
region when Gaussian codebooks are used. We study the time-sharing and the convexification methods as means to enlarge the 
basic HK achievable region. We investigate conditions for which the two regions obtained from time-sharing and concavification 
coincide. Finally, we consider an optimization problem based on extremal inequality and compute its optimal solution. 

In Section III, the notion of an admissible IC is introduced. Some classes of admissible ICs for the two-user Gaussian case is 
studied and outer bounds on the capacity regions of these classes are computed. We also obtain the sum capacity of a specific 
class of admissible IC where it is shown that using Gaussian codebooks and treating interference as noise is optimal. 

In Section IV, we study the capacity region of the weak Gaussian IC. We first derive the sum capacity of this channel for 
a certain range of parameters where it is proved that users should treat the interference as noise and transmit at their highest 
possible rates. We then derive an outer bound on the capacity region which outperforms the known results. We finally prove 
that the basic HK achievable region results in the same enlarged region by using either time-sharing or concavification. This 
reduces the complexity of the characterization of the full HK achievable region when Gaussian codebooks are used. 

In Section V, we study the capacity region of the one-sided Gaussian IC. We present a new proof for the Sato outer bound 
using the extremal inequality. Then, we present methods to simplify the HK achievable region such that the full region can be 
characterized. 

In Section VI, we study the capacity region of the mixed Gaussian IC. We first obtain the sum capacity of this channel 
and then derive an outer bound which outperforms other known results. Finally, by investigating the HK achievable region for 
different cases, we prove that for a certain range of channel parameters, the full HK achievable rate region using Gaussian 
codebooks is equivalent to that of the one-sided IC. Finally, in Section VII, we conclude the paper. 

A. Notations 

Throughout this paper, we use the following notations. Vectors are represented by bold faced letters. Random variables, 
matrices, and sets are denoted by capital letters where the difference is clear from the context. \A\, tr{A], and A* represent 
the determinant, trace, and transpose of the square matrix A, respectively. I denotes the identity matrix. N and 3? are the sets 
of nonnegative integers and real numbers, respectively. The union, intersection, and Minkowski sum of two sets U and V are 
represented hy U l^V,U f\V, and U + V, respectively. We use 7(0;) as an abbreviation for the function 0.5 log2(l -|- x). 

II. Preliminaries 

A. The Two-user Interference Channel 

Definition 1 (two-user IC): A two-user discrete memoryless IC consists of two finite sets ^1 and as input alphabets 
and two finite sets and '3^2 as the corresponding output alphabets. The channel is governed by conditional probability 

distributions ijj{yi,y2\xi,X2) where {xi,X2) e J^i x and (2/1,2/2) e ^ x 
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Pl=P2 



ab = 1 



Fig. 1. Classes of the two-user ICs. 

Definition 2 (capacity region of the two-user IC): A code (2"^\ 2"^^ , n, A", A2) for the two-user IC consists of the fol- 
lowing components for User i G {1,2}: 

1) A uniform distributed message set Mi E [1, 2, 2"^^]. 

2) A codebook Xi = {x,(l), Xi(2), ...,x,(2"-«-)} where x,{-) € J"^". 

3) An encoding function F, : [1,2, 2"^'] Xi. 

4) A decoding function : y, [1, 2, 2"-^']. 

5) The average probabiHty of error A" = P(Gi(yJ ^ Mi). 

A rate pair (i?i,i?2) is achievable if there is a sequence of codes (2"^^ , 2"^^ ,^ vvith vanishing average error 

probabilities. The capacity region of the IC is defined to be the supremum of the set of achievable rates. 
Let denote the capacity region of the two-user IC. The limiting expression for '^^ic can be stated as [5] 



where Xi and yi denote the input and output alphabets of User i e {1, 2}, respectively, and zi ^ A/'(0, 1), Z2 ^ A/'(0, 1) are 
standard Gaussian random variables. Constants a > and 6 > represent the gains of the interference links. Furthermore, 
Transmitter «, i G {1, 2}, is subject to the power constraint Pi. Achievable rates and the capacity region of the Gaussian IC can 
be defined in a similar fashion as that of the general IC with the condition that the codewords must satisfy their corresponding 
power constraints. The capacity region of the two-user Gaussian IC is denoted by ^. Clearly, is a function of the parameters 
Pi, P2, a, and b. To emphasize this relationship, we may write as "^^(Pi, P2, a, as needed. 

Remark 1: Since the capacity region of the general IC depends only on the marginal distributions [16], the ICs can be 
classified into equivalent classes in which channels within a class have the same capacity region. In particular, for the Gaussian 
IC given in any choice of joint distributions for the pair (zi, Z2) does not affect the capacity region as long as the marginal 
distributions remain Gaussian with zero mean and unit variance. 

Depending on the values of a and h, the two-user Gaussian IC is classified into weak, strong, mixed, one-sided, and 
degraded Gaussian IC. In Figure [1] regions in a6-plane together with their associated names are shown. Briefly, if < a < 1 
and < 6 < 1, then the channel is called weak Gaussian IC. If 1 < a and 1 < 6, then the channel is called strong Gaussian 
IC. If either a = or 6 = 0, the channel is called one-sided Gaussian IC. If ab = 1, then the channel is called degraded 
Gaussian IC. If either < a < 1 and 1<6, orO<6<l and 1 < a, then the channel is called mixed Gaussian IC. Finally, 
the symmetric Gaussian IC (used throughout the paper for illustration purposes) corresponds to a = 6 and Pi = P2. 

Among all classes shown in Figure [T] the capacity region of the strong Gaussian IC is fully characterized [3], [2]. In this 
case, the capacity region can be stated as the collection of all rate pairs R2) satisfying 




(1) 



In this paper, we focus on the two-user Gaussian IC which can be represented in standard form as [15], [16] 

2/1 ^Xl+^X2+Zl, 
2/2 = VbXi + X2+ Z2, 



(2) 



Ri < i{Pi), 

R2 < 7(^2), 

i?i + i?2 < mill {7(Pi + aP2), 7(6Pi + P2)} . 
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B. Support Functions 

Throughout this paper, we use the following facts from convex analysis. There is a one to one correspondence between any 
closed convex set and its support function [17]. The support function of any set D e 5R™ is a function <td ■ defined 

as 

o-i5(c) =sup{c*R|Re D}. (3) 

Clearly, if the set D is compact, then the sup is attained and can be replaced by max. In this case, the solutions of (O 
correspond to the boundary points of D [17]. The following relation is the dual of ([3]) and holds when D is closed and convex 

£> = {R|c*R< cr£,(c),V c}. (4) 

For any two closed convex sets D and D', D C D', if and only if ctd < ao'. 



C. Han-Kobayashi Achievable Region 

The best inner bound for the two-user Gaussian IC is the full HK achievable region denoted by "^hk [8]. Despite having 
a single letter formula, "^hk is not fully characterized yet. In fact, finding the optimum distributions achieving boundary 
points of '^HK is still an open problem. We define J# as a subset of "^hk where Gaussian distributions are used for codebook 
generation. Using a shorter description of '^hk obtained in [9], can be described as follows. 

Let us first define as the collection of all rate pairs R2) E 5?^ satisfying 

V'l = 7 f^-^W V (5) 



1 + al3P2 

i?i +i?2< ^3 = min{V'3i,V'32,V'33}, (7) 
T,^OT,<-i ( PP2 \ , f P2+b{l-a)Pi \ , f aP, + a{l - 0)P2 \ 



for fixed a e [0, 1] and e [0, l]!] tps is the minimum of -031, 032, and ^p^^ defined as 



. aF\\ ( P2 + b{l-a)P^ \ 

■^32= 7 h;— — ^ +7 — , , , n , (11) 



1 + al3P2 J \ 1 + baPi 

^\ \ + apP2 )^^\ 1 + baP, )■ ^'^^ 

% is a polytope and a function of four variables Pi, P2, a, and /3. To emphasize this relation, we may write P2, a, 0) 

as needed. It is convenient to represent in a matrix form as = {R|AR < 5'(Pi, P2, a, /?)} where R = (_Ri,i?2)*, 
* = (01,02, i/'3,i/'4, 05)*, and 

■ 1 1 2 1 
1112 



Equivalently, % can be represented as the convex hull of its extreme points, i.e., P2, a, j3) = conv {ri, r2, . . . , rx}, 

where it is assumed that has K extreme points. It is easy to show that K < 7. 

Now, can be defined as a region obtained from enlarging % by making use of the time-sharing parameter, i.e., J# is the 
collection of all rate pairs R = R2Y satisfying 

9 

AR<^A,^'(PH,P2^,a^,/3^), (13) 
i=l 

'in the HK scheme, two independent messages are encoded at each transmitter, namely the common message and the private message, a and /3 are the 
parameters that determine the amount of power allocated to the common and private messages for the two users, i.e., aP\, j3P2 and (1 — a)P\, (1 — f3)P2 
of the total power is used for the transmission of the private/common messages to the first/second users, respectively. 
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where q gN and 

q 

^A,Fh<Pi, (14) 

i=l 

9 

J2XiP2i<P2, (15) 



X^Ai=l, (16) 

A, > 0, {a„ 0i)e [0, V« e {1, . . . , g}. (17) 

It is easy to show that is a closed, bounded and convex region. In fact, the capacity region 'rf which contains is inside 
the rectangle defined by inequaUties i?i < 7(Pi) and i?2 < 7(-P2)- Moreover, (0,0), (7(Pi),0), and (0,7(P2)) are extreme 
points of both ^ and fi^. Hence, to characterize we need to obtain all extreme points of that are in the interior of the 
first quadrant (the same argument holds for '^). In other words, we need to obtain cr^(ci, C2), the support function of either 
when 1 < ci and C2 = 1 or when ci = 1 and 1 < C2. 

We also define ?fi and ^2 obtained by enlarging % in two different maimers, is defined as 

^i(Pi,P2)= U %{Pi,P2,a,p). (18) 

(a,/3)G[0,l]^ 

is not necessarily a convex region. Hence, it can be further enlarged by the convex hull operation. ^2 is defined as the 
collection of all rate pairs R = (Pi, P2)* satisfying 



R = j^AiRi (19) 



i=l 



where q' gN and 



ARi<<b{Pu,P2uai,(3i), (20) 
^XiPii<Pu (21) 

i=l 

^AiP2i<P2, (22) 

i=l 

q' 

^Ai=l, (23) 



i=l 



Ai>0, (ai,A)e [0,1]2; Vie{l,...,(z'}. (24) 

It is easy to show that is a closed, bounded and convex region. In fact, 5^2 is obtained by using the simple method of TD/FD. 
To see this, let us divide the available frequency band into q' sub-bands where Aj represents the length of the I'th band and 
Yli=i Aj = 1- User 1 and 2 allocate Pu and P2i in the i'th sub-band, respectively. Therefore, all rate pairs in %(Pii, P2i, Pi) 
are achievable in the i'th sub-band for fixed {ai,/3i) G [0, 1]^. Hence, all rate pairs in Yl^^i Ai%(Pii, P2i, a^, (3i) are achievable 

provided that J^Ui ^^Pl^ < A and ^^i A»^2» < P2. 

Clearly, the chain of inclusions % C C ^^2 C C 't^hk C 'rf always holds. 



D. Concavification Versus Time-Sharing 

In this subsection, we follow two objectives. First, we aim at providing some necessary conditions such that ^^2 = 
Second, we bound q and q' which are parameters involved in the descriptions of ^ and ^2, respectively. However, we derive 
the required conditions for the more general case where there are M users in the system. To this end, assume an achievable 
scheme for an M -user channel with the power constraint P = [Pi, P2, . . . , Pm] is given. The corresponding achievable region 
can be represented as 

£>o(P,e)={R|^R<^'(P,e)}, (25) 

where ^ is a ii' x M matrix and 6 G [0, 1]^. Z?o is a polyhedron in general, but for the purpose of this paper, it suffices to 
assume that it is a polytope. Since I?o is a convex region, the convex hull operation does not lead to a new enlarged region. 
However, if the extreme points of the region are not a concave function of P, it is possible to enlarge by using two 
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different methods which are explained next. The first method is based on using the time sharing parameter. Let us denote the 
corresponding region as D which can be written as 

R|AR < J2 A^*(P^, e.), ^ A,P, < P, ^ A, = 1, A, > 0, 6, € [0, 1]*^ Vz \ , (26) 

i=l 1=1 i=l J 

where q e N. 

In the second method, we use TD/FD to enlarge the achievable rate region. This results in an achievable region D2 represented 

as 

I 9' 9' 9' 

D2 = \k = J2 W^^i < *(P^, ^^P^ ^ P' E = 1' ^ 0' ®' ^ 1]*' ^ ' (27) 



where g' G N. We refer to this method as concavification. It can be readily shown that D and D2 are closed and convex, and 
D2 C D. We are interested in situations where the inverse inclusion holds. 
The support function of Dq is a function of P, Q, and c. Hence, we have 



aoo (c, P, e) = max{c*R| < *(P, 8)}. 



(28) 



For fixed P and 0, ( |28] | is a linear program. Using strong duality of linear programming, we obtain 

a,5„(c,P,e) = min{y'*(P,e)|A*y-c,y >0}. (29) 

In general, y, the minimizer of ( |29] l, is a function of P, 8, and c. We say Dq possesses the unique minimizer property if 
y merely depends on c, for all c. In this case, we have 

ac„(c,P,e) =y*(c)vI/(P,e), (30) 

where A*y = c. This condition means that for any c the extreme point of Dq maximizing the objective c*R is an extreme 
point obtained by intersecting a set of specific hyperplanes. A necessary condition for Dq to possess the unique minimizer 
property is that each inequality in describing Dq is either redundant or active for all P and 9. 
Theorem 1: If Dq possesses the unique minimizer property, then D = D2- 

Proof: Since D2 C D always holds, we need to show D C D2 which can be equivalently verified by showing a-jj < cr^)^. 
The support function of D can be written as 



CTn(c,P) = max{c*R|R e D} 



(31) 



By fixing P, P/s, G^'s, and A^'s, the above maximization becomes a linear program. Hence, relying on weak duahty of linear 
programming, we obtain 



<7d{c,P)< min y* VA,^'(P„eO. 

yl*y=c.y>0 ^ — ' 
i—1 

Clearly, y(c), the solution of ( |29] l, is a feasible point for ( |32] l and we have 



aD(c,P) <y*(c)^A,^'(P„e, 



i=l 



Using dSOl l. we obtain 



<td{c,P) < ^ Ai(T£,o(c,Pi, 0i)- 



(32) 



(33) 



(34) 



Let us assume R, is the maximizer of 



Hence, we have 



In this case, we have 

9 

ctd(c,P) < ^A,c*R, 



i=l 



era ic,P) < c*^A,R, 



(35) 



(36) 



By definition, J2i=i A^Ri is a point in Z?2- Therefore, we conclude 



az5(c,P) <ai5,(c,P) 



This completes the proof. 



(37) 

□ 
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Corollary 1 (Han [18]): If Dq is a polymatroid, then D=D2- 

Proof: It is easy to show that Dq possesses the unique minimizer property. In fact, for given c, y can be obtained in a 
greedy fashion independent of P and Q. □ 
In what follows, we upper bound q and q' . 

Theorem 2: The cardinality of the time sharing parameter q in ( |26] ) is less than M -+ K + 1, where M and K are the 
dimensions of P and ^'(P), respectively. Moreover, if ^'(P) is a continuous function of P, then q < M + K. 
Proof: Let us define E as 

{111 ^ 
^A,*(P„e,)|^A,P,; < P,^A, = 1,A, > 0,e, e [0,1]*^ \ . (38) 
1=1 1 = 1 4=1 J 

In fact, E is the collection of all possible bounds for D. To prove q < M + K + 1, we define another region Ei as 

El = {(p', s')|o < p', s' = ^{p', e'), e' e [0, i]*^}. 09) 

From the direct consequence of the Caratheodory's theorem [19], the convex hull of Ei denoted by conv Ei can be obtained 
by convex combinations of no more than M + K + 1 points in Ei. Moreover, if 5'(P', 9') is continuous, then M + K points 
are sufficient due to the extension of the Caratheodory's theorem [19]. Now, we define the region E as 

^ = {S'|(P',S') e conv i;i,P' < P}. (40) 

Clearly, E C E. To show the other inclusion, let us consider a point in E, say S = X]?=i -^i^lPii ©i)- Since (P^, 4'(Pi, 6i)) 
is a point in Ei, K(Pt, ^(Pj, ©i)) belongs to conv Ei. Having J^Li ^«P* < P' we conclude J^l^i ^^^'(P^, 6) S E. 

Hence, E C E. This completes the proof. □ 

Corollary 2 (Etkin, Parakh, and Tse [20]): For the Af-user Gaussian IC where users use Gaussian codebooks for data 
transmission and treat the interference as noise, the cardinality of the time sharing parameter is less than 2M. 

Proof In this case, Do = {R|R < ^'(P)} where both P and 5'(P) have dimension AI and 5'(P) is a continuous 
function of P. Applying Theorem |2] yields the desired result. □ 

In the following theorem, we obtain an upper bound on q' . 

Theorem 3: To characterize boundary points of D2, it suffices to set q' < M + 1. 
Proof: Let us assume R is a boundary point of D2- Hence, there exists c such that 

CT_D2 (c, P) = max c*R = c*R, (41) 

where R = X]?=i ^i^i the optimum is achieved for the set of parameters 6^, A^, and P^. The optimization problem in 
(BTT l can be written as 

1' 

cr_D2(c, P) =max ^Aj.g(c,Pi) (42) 

i=l 

9' 9' 



subject to: ^ Aj = 1, ^ A^Pj < P, 



where g{c,P) is defined as 



i=l i=l 

0< A„0<P„ Vi e {1,2,...,<7'}, 

g(c,P) =maxc*R (43) 
subject to: AR < *(P, 6), < 6 < 1, 

In fact, cr/)2(c,P) in (|42] | can be viewed as the result of the concavification of ^(c, P) [19]. Hence, using Theorem 2.16 in 
[19], we conclude that q' < M + 1. □ 

Remarkable point about Theorem |3] is that the upper bound on q' is independent of the number of inequalities involved in 
the description of the achievable rate region. 

Corollary 3: For the M-user Gaussian IC where users use Gaussian codebooks and treat the interference as noise, we have 
D2 = D mdq = q' = M + 1. 
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E. Extremal Inequality 

In [14], the following optimization problem is studied: 

W = max /i(X + Zi) - ^ih{^ + Z2), (44) 

Qx<S 

where Zi and Z2 are n-dimensional Gaussian random vectors with the strictly positive definite covariance matrices Qzi 
and Qz2' respectively. The optimization is over all random vectors X independent of Zi and Z2. X is also subject to the 
covariance matrix constraint Qx < S, where 5* is a positive definite matrix. In [14], it is shown that for all /i > 1, this 
optimization problem has a Gaussian optimal solution for all positive definite matrices Qzi and Qz^ ■ However, for < /i < 1 
this optimization problem has a Gaussian optimal solution provided Qzi < Qz2^ Qz2 ~ Qzi is a positive semi-definite 
matrix. It is worth noting that for = 1 this problem when Qzi < Qz2 is studied under the name of the worse additive noise 
[21], [22]. 

In this paper, we consider a special case of (l44l l where Zi and Z2 have the covariance matrices Nil and N2I, respectively, 
and the trace constraint is considered, i.e., 

max /i(X + Zi)-/i/i(X + Z2). (45) 

tr{Qx}<nP 

In the following lemma, we provide the optimal solution for the above optimization problem when A^i < N2. 
Lemma 1: If A^i < N2, the optimal solution of (l45T l is iid Gaussian for all < /i and we have 

1) For < /X < j^^^, the optimum covariance matrix is PI and the optimum solution is 

= ^ log [(2^e)(P + Ni)]-^ log [(2^e)(P + N2)]. (46) 

2) For j^^^p < ^ ^ the optimum covariance matrix is and the optimum solution is 



ft 

= - log 



(2.e)^^"^^ 



/in 
-^log 



^i{2^le){N2~Nl) 



(47) 



3) For ^ < /i, the optimum covariance matrix is and the optimum solution is 



= I log(27re7Vi) _ ^ log(2^eiV2). (48) 
Proof: From the general result for ( |44| ). we know that the optimum input distribution is Gaussian. Hence, we need to 
solve the following maximization problem: 

W =max i log ((27re)"|Qx + NJ]) - ^ log ((27re)"|gx + N2l\) (49) 

subject to: < Qx, if{Qx} < nP. 

Since Qx is a positive semi-definite matrix, it can be decomposed as Qx = C/AC/*, where A is a diagonal matrix with 
nonnegative entries and C/ is a unitary matrix, i.e., C/C/* = /. Substituting Qx = UAU* in ( |49b and using the identities 

tr{AB} = tr{BA} and \AB + I\ = \BA + /|, we obtain 

W =max i log ((27re)"|A + Nil\) - | log ((27re)"|A + iV2/|) (50) 
subject to: < A, tr{A} < nP. 
This optimization problem can be simplified as 

n 

W =max| ^ [log(2^e)(A, + iVi) - /* log(2^e)(A, + N2)] (51) 

i=l 

n 

subject to: < Aj V«, A^ < nP. 

i=l 

By introducing Lagrange multipliers ip and <& = 02, we obtain 

n / n \ n 

L(A, 7^, $) = max ^ ^ [log(27re)(A, + Ni) - /i log(2^e)(A, + iVa)] + nP - ^ A, + ^ 0,A,. (52) 
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The first order KKT necessary conditions for the optimum solution of ( |52] | can be written as 

1 ^ 



^ + 0, -0, Vze {l,2,...,n}, (53) 



A, + iVi A, + N2 

==0, Vie {l,2,...,n}. (55) 
It is easy to show that when iVi < N2, \ — \i — ... — \n and the only solution for A is 

if < u < 

if m </^^t (56) 
if t <M 

Substituting A into the objective function gives the desired result. □ 
In Figure |2] the optimum variance as a function of ^ is plotted. This figure shows that for any value of /i < p^j^^ , we 
need to use the maximum power to optimize the objective function, whereas for ^ > , we use less power than what is 

permissible. 

Lemma 2: If A^i > N2, the optimal solution of (05]) is iid Gaussian for all 1 < /i. In this case, the optimum variance is 
and the optimum W is 

W='^ log(2^eiVi) - ^ log(27re7V2). (57) 
Proof: The proof is similar to that of Lemma [T] and we omit it here. □ 




Corollary 4: For ji = 1, the optimal solution of (1451 ) is iid Gaussian and the optimum W is 

We frequently apply the following optimization problem in the rest of the paper: 

fh{P,Ni,N2,a,^i)= max h{l^ + Zi) - ^ih{^/a^ + Z2) , (59) 

tr{Qy^}<nP 

where iVi < N2/a. Using the identity h{AK.) = log(|A|) + /i(X), (|59]l can be written as 

h{P,N^,N2, 
Now, Lemma [T] can be applied to obtain 



A(P,Ari,7V2,a,Ai) = 5loga+ max h{y. + Z^) - ^ih{y. + '^). (60) 

2 tr{Qx}<"P ^Ja 



5 log 

A(P,iVi,7V2,a,/i)= <( ilog 



_(27re)(P + iVi) 



- f log [(2^e) [aP + N2)] if < < 



2 ^"-"S fj,-l " P+JVi ^ P aWi ^ ' 



i log(2^eiVi) - ^ log(2^eiV2) if < 
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Fig. 3. An admissible cliannel. /i and /2 are deterministic functions. 



III. Admissible Channels 

In this section, we aim at building ICs whose capacity regions contain the capacity region of the two-user Gaussian IC, i.e.. 
Since we ultimately use these to outer bound these ICs need to have a tractable expression (or a tractable outer bound) 
for their capacity regions. 

Let us consider an IC with the same input letters as that of and the output letters yi and y2 for Users 1 and 2, respectively. 
The capacity region of this channel, say contains ^ if 

/K;2;r)<^«;yi), (62) 
i{x^;y2) <nx''2;y^), (63) 

for all p{x")p{x2) and for all n e N. 

One way to satisfy (|62] | and (|63] | is to provide some extra information to either one or to both receivers. This technique 
is known as Genie aided outer bounding. In [12], Kramer has used such a genie to provide some extra information to both 
receivers such that they can decode both users' messages. Since the capacity region of this new interference channel is equivalent 
to that of the Compound Multiple Access Channel whose capacity region is known, reference [12] obtains an outer bound 
on the capacity region. To obtain a tighter outer bound, reference [12] further uses the fact that if a genie provides the exact 
information about the interfering signal to one of the receivers, then the new channel becomes the one-sided Gaussian IC. 
Although the capacity region of the one-sided Gaussian IC is unknown for all ranges of parameters, there exists an outer bound 
for it due to Sato and Costa [23], [11] that can be applied to the original channel. In [13], Etkin et al. use a different genie 
that provides some extra information about the intended signal. Even though at first glance their proposed method appears to 
be far from achieving a tight bound, remarkably they show that the corresponding bound is tighter than the one due to Kramer 
for certain ranges of parameters. 

Next, we introduce the notion of admissible channels to satisfy (|62] | and ( |63] ). 

Definition 3 (Admissible Channel): An IC "if with input letter Xi and output letter iji for User i e {1,2} is an admissible 
channel if there exist two deterministic functions — and 1)2 ~ /2(y2 ) such that 

/(x^2/r)<^«;yi), (64) 

I{xl^;y^) <Iix^;y^) (65) 

hold for all p{xi)p{x2) and for all n gN. S" denotes the collection of all admissible channels (see Figure O. 

Remark 2: Genie aided channels are among admissible channels. To see this, let us assume a genie provides si and S2 as 
side information for User 1 and 2, respectively. In this case, jji — {yi,Si) for i G {1,2}. By choosing fi{yi,Si) = yi, we 
observe that i/i — yi, and hence, ( l64b and ( |65] ) trivially hold. 

To obtain the tightest outer bound, we need to find the intersection of the capacity regions of all admissible channels. 
Nonetheless, it may happen that finding the capacity region of an admissible channel is as hard as that of the original one (in 
fact, based on the definition, the channel itself is one of its admissible channels). Hence, we need to find classes of admissible 
channels, say JF, which possess two important properties. First, their capacity regions are close to Second, either their 
exact capacity regions are computable or there exist good outer bounds for them. Since ^ C S', we have 

"^Cpl-^'. (66) 

Recall that there is a one to one correspondence between a closed convex set and its support function. Since is closed and 
convex, there is a one to one correspondence between and a-^. In fact, boundary points of correspond to the solutions 
of the following optimization problem 

o'v(ci,C2)— max cii?i+C2-R2- (67) 

Since we are interested in the boundary points excluding the Ri and R2 axes, it suffices to consider < ci and < C2 where 

C1+C2 = 1- 
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Since C , we have 

0''^(C1,C2) < cr-g"(ci,C2). (68) 

Taking the minimum of the right hand side, we obtain 

o-'g' (ci , C2 ) < min a-^, (ci ,02), (69) 



which can be written as 



?(ci,C2)< min max c\R\+CiRi- (70) 



For convenience, we use the following two optimization problems 

(T<g'(/x, 1)= max 11R1 + R2, (71) 

(-R.i,-R2)e^' 

a^{l,ii)= max i?i + /ii?2, (72) 

where 1 < /x. It is easy to show that the solutions of ( fTTT i and (iTZt correspond to the boundary points of the capacity region. 
In the rest of this section, we introduce classes of admissible channels and obtain upper bounds on ac^i (/x, 1) and <t<^'{1, /i). 

A. Classes of Admissible Channels 

1) Class Al: This class is designed to obtain an upper bound on a^^lji, 1). Therefore, we need to find a tight upper bound 
on (T<^i{fi, 1). A member of this class is a channel in which User 1 has one transmit and one receive antenna whereas User 2 
has one transmit antenna and two receive antennas (see Figure |4]l. The channel model can be written as 

2/1 = xi + ^/ax2 + zi, 

§21= X2 + VVxi + Z21, (73) 

2/22= 3^2+222, 

where yi is the signal at the first receiver, ^21 and ^22 are the signals at the second receiver, zi is additive Gaussian noise 
with unit variance, Z21 and Z22 are additive Gaussian noise with variances A''2i and A''22, respectively. Transmitters 1 and 2 
are subject to the power constraints of Pi and P2, respectively. 

To investigate admissibility conditions in (|64| | and (l65l l, we introduce two deterministic functions /i and /2 as follows (see 
Figure lU 

hiyi)^yi, (74) 

/2(y2"2, y2"l)= (1 - Vr2)y22 + VS^^S 1 , (75) 

where < 32- For 52 — 0, the channel can be converted to the one-sided Gaussian IC by letting A^2i 00 and iV22 = 1. 
Hence, Class Al contains the one-sided Gaussian IC obtained by removing the link between Transmitter 1 and Receiver 2. 
Using /i and /2, we obtain 

Vi ^x'l + ^x'i + zr, (76) 

y2 =Vb^2x'l + X'i + (1 - ^2)Z^2 + (77) 
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Hence, this channel is admissible if the corresponding parameters satisfy 

b'g2 ^b, 

We further add the following constraints to the conditions of the channels in Class Al: 

h' <N21, 

aN22 < 1. 

Although these additional conditions reduce the number of admissible channels within the class, they are needed to get a 
closed form formula for an upper bound on cr<i^'(/i, 1). In the following lemma, we obtain the required upper bound. 
Lemma 3: For the channels modeled by (|73] | and satisfying ( |79] l. we have 



(78) 



(79) 



fT<^' (m, 1) <min ^ log [27re(Pi + aPs + 1)] - ^ log(27re) + ^ log ( 



1 



N21 . b'Pi 



Pi 



22 



N22 P2 + N: 



22 



(80) 



1 



+ M2/h (^Pl,l,iV21,5',-J-j +A(P2,Af22,l,a,/ii) 

subject to: /ii + /i2 = ^, /ii, Ai2 > 0. 
Proof: Let us assume i?i and i?2 are achievable rates for User 1 and 2, respectively. Furthermore, we split ji into /ii > 
and /i2 > such that ji — fjLi + 112- Using Fano's inequality, we obtain 

n{^iR^ + R2) <fil{x^;y^) + ^22, ^21) + 

; + M2/(a;?; + ^(2^2 ; ^22, ^2") + 

</xi/(x^ + fi2lix1;yl'\x^) + I{x^; y^2, 1121) + 

=/ZiJ(a;5^; + M2/W; yrk2 ) + ^(2^2; ^2" |y2"2) + ^(^^2 ; ^22) + 
=/zi/i(yn - ^Ji,h{ylWl) + ^^2h{y^\xl) - /i2/i(yr|x^4) 

+/i(y2i|y22) - KvlxV2. V22) + /^(y2"2) - Ky22V2) + '^e™ 
= [^iMyi)-A^2Myil<,a;^)] + {i^2h{rrV-2)-m2xV2.y22)\ 
^{Kv2i\v22) - Ky22Vr)\ + {KV22) - A^i/^(yri2^r)] + (81) 

where (a) follows from the fact that x\ and are independent. Now, we separately upper bound the terms within each bracket 
in (iB- 

To maximize the terms within the first bracket, we use the fact that Gaussian distribution maximizes the differential entropy 
subject to a constraint on the covariance matrix. Hence, we have 



[iih{yl) - ^2/i(yi K, 4)= liih{x1 + + - [i2K^T) 

< ^ log [27re(Pi + aP2 + 1)] - ^ log(2^e). 

Since 6' < iV2i^ we can make use of Lemma [T] to upper bound the second bracket. In this case, we have 

1 



P^2hmX-2) - hmAx-2,y22)^ ^^2 h{x^l + Z^) - -h{^ b' X^l + Z^\) 

A*2 



(82) 



<tX2nh[PiXN2i,b', 



1 

M2 



(83) 



where fh is defined in ( |6T| i. 

We upper bound the terms within the third bracket as follows [13]: 



h{y^,m2) - hm2\^i 



[a] 

<E^(^2lH|y22H)-/»(^2"2) 

2Tie (N2l+b'Pi\l] 



i=l 
(b) " 1 , 

i=l 



P2\i]N22 



<2log 



n 



P2[l]+N22 
, ^ELl^2WiV22 



- - log (27reAr22) 



No 



- - log (27re7V2: 



27re 7V21 + 5' Pi 



P2iV, 



22 



iVai , &'Pi 



^2 4 

P2 

P2 + N2' 



No 



n 



- log (27re7V22) 



(84) 



13 



^2 




Fig. 5. Class A2 admissible channels. 



Admissible Channel 



^12) = (1 - y/!h)yii + y/giyi2 
-* 61 




yi2 VSi 



/2(i/2) = m 



V2 



where (a) follows from the chain rule and the fact that removing independent conditions does not decrease differential entropy, 
(b) follows from the fact that Gaussian distribution maximizes the conditional entropy for a given covariance matrix, and (c) 
follows form Jenson's inequality. 

For the last bracket, we again make use of the definition of fh. In fact, since aA^22 < 1, we have 



221 



< n//i(P2,A^22,l,a,/ii). 



(85) 



Adding all inequalities, we obtain 



^iRi + R2 < Y log [2ne{Pi + aP^ + 1)] - y log(2^e) + - log 



^21 
N22 



P2 



b'Pi 



N22 P2 + N2. 



fl2h[Pl,l,N21,b', 



1 

M2 



fh{P2,N22, l,a,Aii), 



(86) 



where the fact that e„ — > as n — > cx) is used to eliminate e„ form the right hand side of the inequality. Now, by taking the 
minimum of the right hand side of (l86T l over all /ii and /i2, we obtain the desired result. This completes the proof. □ 
2) Class A2: This class is the complement of Class Al in the sense that we use it to upper bound (J^{1, /i). A member of 
this class is a channel in which User 1 is equipped with one transmit and two receive antennas, whereas User 2 is equipped 
with one antenna at both transmitter and receiver sides (see Figure |5]l. The channel model can be written as 



yii xi + 211, 

yi2 = Xi + Va'x2 + Zi2, 
m ^ X2+ Vbxi + Z2, 



(87) 



where yn and yi2 are the signals at the first receiver, y2 is the signal at the second receiver, Z2 is additive Gaussian noise 
with unit variance, zn and Z12 are additive Gaussian noise with variances Nn and A^i2, respectively. Transmitter 1 and 2 are 
subject to the power constraints Pi and P2, respectively. 

For this class, we consider two linear functions /i and /2 as follows (see Figure |5]l: 



/i(yn,yi2)=(l 

/2(y2")=y2- 
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51^12 > 



Similar to Class Al, when gi — 0, the admissible channels in Class A2 become the one-sided Gaussian IC by letting N12 
and iVii = 1. Therefore, we have 

9^)zii + ^/5^2^2, 



=x^ + ^a'gix^ + (1 

We conclude that the channel modeled by (|87] | is admissible if the corresponding parameters satisfy 



a 91 



(1-^51)^11+51^: 



12 



= a, 
= 1. 



(88) 
(89) 



(90) 
(91) 



(92) 
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3^2 



fiiyii^yu) = (1 - ^/F^)yll + Vsim 
— - ill 




1/2 



Admissible Channel /2(y22, y2i) = (1 - ^/s^)^/22 + V^?2y2i 



Fig. 6. Class B admissible channels. 



Similar to Class Al, we further add the following constraints to the conditions of Class A2 channels: 

a' < Ni2, 
hNii < 1. 

In the following lemma, we obtain the required upper bound. 
Lemma 4: For the channels modeled by dSTl i and satisfying 



, we have 

a^' (1, Ai) <min ^ log [2^e(6Pi + P2 + 1)] - ^ log(27re) + i log 

1 



A^12 



Pi 



Nn Pi + Nn 



(93) 



(94) 



+ M2/h (^P2,l,iVi2,a',— j +/,.(Pi,7Vii,l,6,Aii) 

subject to: fii + fi2 — Z^, fJ-i, ^J'2 > 0. 
Proof: The proof is similar to that of Lemma [3] and we omit it here. 
3) Class B: A member of this class is a channel with one transmit antenna and two receive antennas for each user modeled 
by (see Figure |6]i 

yii = xi + zii, 

yi2 = Xi+\/a/X2 + Zi2, 
y21 = X2+ Vl/Xi + Z21, 
y22 = X2+ Z22, 

where yn and j/12 are the signals at the first receiver, y2i and ^22 are the signals at the second receiver, and Zij is additive 
Gaussian noise with variance Nij for i,j S {1,2}. Transmitter 1 and 2 are subject to the power constraints Pi and P2, 
respectively. In fact, this channel is designed to upper bound both cr<^f (/i, 1) and <t<^{1, 11) . 

Next, we investigate admissibility of this channel and the conditions that must be imposed on the underlying parameters. 
Let us consider two linear deterministic functions /i and /2 with parameters < 51 and < 52, respectively, as follows (see 
Figure |6]l 



□ 



(95) 



/i(yri,yi2)=(i 
/2(y2"2,y2i)-(i 



5ijyii 
52)2/22 



51 ^^2: 
52^21- 



Therefore, we have 



Vi 



' a giX2 



To satisfy 



and 



^2" 

it suffices to have 



(1 
(1 



(i-VsTKi 

(1 - ^2)Z'^2 - 



51^12; 

g^z^i- 



TifNi 



22 



a 91 
b'92 

' giNi2 
- 52^21 



(96) 
(97) 

(98) 
(99) 



(100) 
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Hence, a channel modeled by ( |95T l is admissible if there exist two nonnegative numbers gi and 92 such that the equalities in 
(llOOI l are satisfied. We further add the following two constraints to the equality conditions in (llOOl i: 



b'Nn 
a'N22 



< N21, 

< N12. 



(101) 



Although adding more constraints reduces the number of the admissible channels, it enables us to compute an outer bound on 
cr^' (/Li, 1) and cr<^/ (1, fi). 

Lemma 5: For the channels modeled by ( |95] l and satisfying (llOlt . we have 



a'P2 + Nr 



+ 7 



P2 

N22 



P2 



b'Pi + N21 



1 



+fh{P2,N22,Ni2,a',fi) + ^log((2^e)(a'P2 +iVi2)) - ^ log((27re)(P2 + ^22)), 



Pi 



a'P2 + iVi: 



Ai7 



iVn, iV2i, 6', + I log((2^e)(6'Pi + TVai)) - ^ log((2^e)(Pi + iVn)). 
Proof: We only upper bound a<^'{iJ,, 1) and an upper bound on (j'^i{l,fj,) can be similarly obtained. Let us assume Pi 
and P2 are achievable rates for User 1 and User 2, respectively. Using Fano's inequality, we obtain 

n{fiRi + P2) <M/(a;?; yii, yia) + ^(2^2 ; ^22, ^21) + 

^tiiix^;y^2\yii) + t^ii^i;yii) 

+^(2^2 ; ^21 1^22: ) + I{X2:V22) + 

=M(yr2lyn) -A^/i(yi2l<,yii) + MMyn) -M/i(ynN?) 

+ /i(y2"l|y22) - h{V2l\^2.V22) + /i(y2"2) " /l(y2"2N2) + "^n 

=[M/i(yr2iyii) - M/^(yrikr)] + [h{v2i\v22) - hm2\^-2)] 

+ [^ih{yl^) - h{yl^,\x^,yl^,)] + [h{yl^,) - fih{y^2K , y^,)] + ne„. (104) 
Next, we upper bound the terms within each bracket in (11041 ) separately. For the first bracket, we have 



P2_ 

N22 



P2 



6' Pi + N21 



1 



(102) 



(103) 



(a) 

f^h{y'^2\yii) ~ ^^HyllK) < M E ^(^i^ M lyn W) - ^ log (27re7Vii) 



i=l 



W "1 
<M2^^1og 



i=l 

< — log 
- 2 ^ 



< — log 
- 2 ^ 



27re iVi2 + a'P2[i] 



PiHA^ii 
PiW + A^ii 



fin 



log(27re7Vii) 



\ i=l 

27re ( 7Vi2 + a'P2 - 



fin 



log(27re7Vii) 



PiiVi 



fin 



log 



iVl2 

A^ii 



Pi + Nn 
a'P2 , Pi 



fin 



log (2^e7Vii) 



(105) 



iVii Pi + iVii , 

where (a) follows from the chain rule and the fact that removing independent conditions increases differential entropy, (b) 
follows from the fact that Gaussian distribution optimizes conditional entropy for a given covariance matrix, and (c) follows 
form Jenson's inequality. 

Similarly, the terms within the second bracket can be upper bounded as 

P2 



Ky2i\y22)^m2\^2)<i^^og 



N21 b'Pi 

-I i ^ 

^^22 N22 P2 + N22 



(106) 



Using Lemma [T] and the fact that A^n < N2i/b', the terms within the third bracket can be upper bounded as 

1 



fihiff^) - Hy^,\xl^,y^^)= fi \h{x-^ + z^i) - -h{.Jb'x^l + ^i) 
<linfh(Pi,N^i,N2i,b' ^ 



Since 1 < /i, from ( 1611 1 we obtain 



fihirii) - h{y^i^\x-^,ffi^) < if log((2^e)(Pi + Nu)) - | log((2^e)(6'Pi + 7V21)). 



(107) 



(108) 
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For the last bracket, again we use Lemma [T] to obtain 

Hy^2) - Ai/i(yi2l<, yii)= Kx'i + z^^) - fihiV^x^ + z^,) 

<nfh{P2,N22,Ni2,a,^i). (109) 

Adding all inequalities, we have 

uRi + i?2 <-;r log — h — h — — + - log ' 



2 °V^ii ^11 Pi+A^ii/ 2 °\N22 N22 P2+N22 
+ 1 log((2^e)(Pi + Nn)) - \ log((27re)(6'Pi + N21)) + A(P2, A^22, A^i2, a', m), (HO) 

where the fact that e„ ^ as rt ^ 00 is used to eliminate e„ from the right hand side of the inequality. By rearranging the 
terms, we obtain 



iVii a'P2 + Ni2j ' \N22 b'Pi+N2i 
+fhiP2,N22,Ni2,a', /i) + ^ log((27re)(a'P2 + N12)) ^ ^ log((27re)(P2 +^¥22)). 



This completes the proof. □ 

A unique feature of the channels with 
in (11021 ) and ( 11031 ) become, respectively. 



A unique feature of the channels within Class B is that for 1 < fi < ™d I < fi < upper bounds 



and 



^'^^^ (Wi + Vl^) + ^^^^ ' • ''''' 



On the other hand, if the receivers treat the interference as noise, it can be shown that 

^Pi Pi 
,iVn a'P2 + Afi2 
and 

are achievable. Comparing upper bounds and achievable rates, we conclude that the upper bounds are indeed tight. In fact, 
this property is first observed by Etkin et al. in [13]. We summarize this result in the following theorem: 

Theorem 4: The sum capacity in Class B is attained when transmitters use Gaussian codebooks and receivers treat the 
interference as noise. In this case, the sum capacity is 

iVii ' a'p^Tl^ ^^'{jh2^ VKTWi) ' ^^^^^ 

Proof: By substituting ji — \ in ( 111 it . we obtain the desired result. □ 
4) Class C: Class C is designed to upper bound (t<^(/z, 1) for the mixed Gaussian IC where 1 < 6. Class C is similar to 
Class Al (see Figure |4]), however we impose different constraints on the parameters of the channels within Class C. These 
constraints assist us in providing upper bounds by using the fact that at one of the receivers both signals are decodable. 

For channels in Class C, we use the same model that is given in (|73] |. Therefore, similar to channels in Class Al, this 
channel is admissible if the corresponding parameters satisfy 

b'g2 =b, 

{I- ^fN22 + 92N21 =1. ' 

Next, we change the constraints in d79] l as 

b' >N2,, 

aN22 < 1. ^ ^ 

Through this change of constraints, the second receiver after decoding its own signal will have a less noisy version of the 
first user's signal, and consequently, it is able to decode the signal of the first user as well as its own signal. Relying on this 
observation, we have the following lemma. 
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Lemma 6: For a channel in Class C, we have 

U — 1 1 / / P-yN-y-y 

1) log (2^e(Pi + aP2 + 1)) + - log 2^e „ , + 6'Pi + TV^j 

- i log(2^eiV2i) - i log(2^eiV22) + A(P2, iV22, 1, a, ^ - 1). (118) 
Proof: Since the second user is able to decode both users' messages, we have 

i?i<-/W;yr), (119) 
n 

i?i< ^/(a^?;y2"i,y22k2), (120) 

R2<-i{x^;y^i,f22K), (121) 

n 

Ri+R2< -I{x",,xl^;y^„y^,). (122) 
n 

From aiV22 < 1, we have < I {x" ; y2i\x2) — ^(^^i! 2/211 y22l^2)- Hence, ( I120l l is redundant. It can be shown that 

^,R,+R, < + ^I{x-,,x^;y^„y^2)- (123) 



Hence, we have 



fiRi+R2< ^My") - ^Hyl^Wl) + i/i(y2"i,y22) - -hm.^y^^lx'l^x^) 

71/ 71/ 71/ 71/ 

= ^My") + -h(y^,\y22) - -h{y^,,y^2K,x^^) 
n n n 



-h{y22) ^h{y-\x",) 
n n 

Next, we bound the different terms in ( I124l i. For the first term, we have 



(124) 



^h{y^) < ^ log (2^e(Fi + aP2 + 1)) . (125) 



The second term can be bounded as 



l;Hy2i\y22) < \ log (2^6 i^ff- + b'Pi + N21]]. (126) 



The third term can be bounded as 



-Hy2i.y22\xi,xl^) = ilog(2^eiV2i) + ilog(2^eiV22). (127) 
n 2 2 

The last terms can be bounded as 

-hm2) " ^/i(yrkr )= -Kx^^ + - ^-^hvex^ + (128) 

77- 77- 77- 77 

<A(P2,iV22,l,a,M-l). (129) 
Adding all inequaUties, we obtain the desired result. □ 

IV. Weak Gaussian Interference Channel 

In this section, we focus on the weak Gaussian IC. We first obtain the sum capacity of this channel for a certain range of 
parameters. Then, we obtain an outer bound on the capacity region which is tighter than the previously known outer bounds. 
Finally, we show that time-sharing and concavification result in the same achievable region for Gaussian codebooks. 
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A. Sum Capacity 

In this subsection, we use the Class B channels to obtain the sum capacity of the weak IC for a certain range of parameters. 
To this end, let us consider the following minimization problem: 



W =min7 



Pi 



Pi 



+ 7 



Nil a'P2 + Ni2 
subject to: 

a' 91 = a 
b'92 = b 
h'Nii < N21 
aN22 < N12 

(1 - VolfNii + giNi2 = 1 



Po 



N22 h'Pi + N21 



(130) 



(1 - ^fg^f N22 + 52^2 



1 



< [a',h',gi,g2,Nii,Ni2,N22,N2i]. 

The objective function in ( 11301 ) is the sum capacity of Class B channels obtained in Theorem |4] The constraints are the 
combination of (llOOI l and jlOlb where applied to confirm the admissibility of the channel and to validate the sum capacity 
result. Since every channel in the class is admissible, we have "^sum < W . Substituting Si = giNi2 and S2 — 32^21, we 
have 



W ^ 



mm 7 



(1 - ^fPi 



91 Pi 



l^Si 
subject to: 

b{l - Si) 



aP2 + Si 



-7 



(1 - V9^?P2 

1-S2 



92P2 



hPi + 5*2 



(131) 



< ^2 < 1 



a{l - S2) 



<Si<l 



< [.91,. 92]- 

By first minimizing with respect to gi and g2, the optimization problem ( 11311 ) can be decomposed as 

W ^minV^i + W2 

subject to: < < 1, < S2 < 1. 



where Wi is defined as 



Wi =min7 

91 



(1 - V9-i?Pi 
l~Si 



9iPi 



aP2 + Si 



(132) 



(133) 



subject to: ^ '^^^ < (1 - < gi. 



Similarly, W2 is defined as 



W2 =min7 

92 



{l-^fP2 , 52^2 



subject to: 



1 - 52 hPi + ^2 

a(l - S2) 



(134) 



Si 



< (I-VS^)', 0<.g2. 



The optimization problems (1133b and ( 1134b are easy to solve. In fact, we have 

7 (irk) if Vh{l + aP2)<^S2{l-Si) 



Wi 



W2 



^ , bPt, {l-^/b{l-St)/S2?Pi 
' I S2 ^ aP2+Si 



^ ( 1+hpl ) 



Si 



{l-^ail-S2)/SifP2 
bPi+S2 



Otherwise 

if + bPi) < ^Siil-S2) 

Otherwise 



(135) 



(136) 
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From iU5\ and (|136t . we observe that for Si and S2 satisfying Vb{l + aP2) < \/S^(T~Si) and ^{1 + bPi) < 
\/Si{r^~S2), the objective function becomes independent of Si and 52- In this case, we have 

W^j( ^ , (137) 



1 + 0^2/ \l + bPi 

which is achievable by treating interference as noise. In the following theorem, we prove that it is possible to find a certain 
range of parameters such that there exist 5*1 and 52 yielding (I137l l. 
Theorem 5: The sum capacity of the two-user Gaussian IC is 

%u,n - 7 r 1 + 7 r ^ , (138) 



l + aP2j ' \l + bPi 
for the range of parameters satisfying 

VbPi + V^P2<^—^^^. (139) 
Vab 

Proof: Let us fix a and b, and define D as 



D = I (A, P2)|Pi < ' - 1, P2 < ^^^^''^^^ - i < 5i < 1, < 52 < 1 1 . (140) 

[ by/a b aVb a J 

In fact, if D is feasible then there exist < 5i < 1 and < ^2 < 1 satisfying y/b{l + aP2) < ^^2(1 - Si) and 
\/a{l + bPi) < ^5i(l — 52). Therefore, the sum capacity of the channel for all feasible points is attained due to ( I137l i. 
We claim that D = D', where D' is defined as 

D' =i^iPi,P2)\vbPi+^P2<^^^^y (141) 

To show D' C D, we set 5i 1 - ^2 in ( fT40l i to get 

{Pi,P2)\Pi <p^-\,P2< - -,0 < 5i < ll C i^. (142) 

by/a b aVb a J 

It is easy to show that the left hand side of the above equation is another representation of the region D'. Hence, we have 
D' C D. 

To show D C D', it suffices to prove that for any (Pi, P2) e D, VbPi + y/EP2 < ^'^^ holds. To this end, we introduce 
the following maximization problem: 

J= max VbPi+JaP2, (143) 

(Pi,P2)eD 

which can be written as 



J. max ^MiZM±yMiZM 1 1. (144) 

(Si,S2)e(04)" \/ab v6 

It is easy to show that the solution to the above optimization problem is 

= ^u-^-nf ^^^^^ 
y/ab Va vb 

Hence, we deduce that D C D' . This completes the proof. □ 
Remark 3: The above sum capacity result for the weak Gaussian IC (see also [24]) has been established independently in 
[25] and [26]. 

As an example, let us consider the symmetric Gaussian IC. In this case, the constraint in ( 11391 ) becomes 

1 - 2y/E 

P < „ r ■ (146) 
lay/ a 

In Figure |7] the admissible region for P, where treating interference as noise is optimal, versus y/a is plotted. For a fixed P 
and all < a < 1, the upper bound in (1130b and the lower bound when receivers treat the interference as noise are plotted in 
Figure |8] We observe that up to a certain value of a, the upper bound coincides with the lower bound. 



100- 



80- 




.4Cl 1 1 1 1 1 1 

0.2 0.4 0.6 0.8 1 



Fig. 7. The shaded area is the region where treating interference as noise is optimal for obtaining the sum capacity of the symmetric Gaussian IC. 
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B. New Outer Bound 

For the weak Gaussian IC, there are two outer bounds that are tighter than the other known bounds. The first one, due to 
Kramer [12], is obtained by relying on the fact that the capacity region of the Gaussian IC is inside the capacity regions of 
the two underlying one-sided Gaussian ICs. Even though the capacity region of the one-sided Gaussian IC is unknown, there 
exists an outer bound for this channel that can be used instead. Kramers' outer bound is the intersection of two regions Ei 
and E2- Ei is the collection of all rate pairs (i?i,i?2) satisfying 

«,<,Yii^V „47) 



V/3P' + l/a, 

i?2< liPP'). (148) 

for all (3 £ [0,/?niax], where P' — Pi/a + P2 and /J^ax = pT^x+pjj- Similarly, E2 is the collection of all rate pairs (i?i,i?2) 
satisfying 

Ri< i{aP"), (149) 

for all a e [0, Ofmax], where P" = Pi + and Omax = p"(i+p^) - 

The second outer bound, due to Etkin et al. [13], is obtained by using Genie aided technique to upper bound different 

linear combinations of rates that appear in the HK achievable region. Their outer bound is the union of all rate pairs R2) 
satisfying 



P2 





Ri< 

R2< 


Ri 


+ R2< 


Ri 


+ R2< 


Ri 


+ R2< 


2Ri 


+ R2< 


Ri + 


2R2< 



1 + bPi 

Pi 
l + aP2 
Pi 



7 bPi 



1 + bPiJ 'V 1 + aPs 

. , P2 



l + aP2 

Pi 
l + bPi 



0.5 log 
0.5 log 







(151) 






(152) 






(153) 






(154) 






(155) 


1 - 


fPi\ 


(156) 


u 


-bPj' 


1 - 


f P2 \ 


(157) 
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-aP2j ■ 



In the outer bound proposed here, we derive an upper bound on all linear combinations of the rates. Recall that to obtain the 
boundary points of the capacity region 'rf, it suffices to calculate o''^(/i, 1) and (J^{1, fJ.) for all 1 < /i. To this end, we make 
use of channels in Al and B classes and channels in A2 and B classes to obtain upper bounds on a<-^{ii, 1) and a'^{l,^), 
respectively. 

In order to obtain an upper bound on cr<^(/x, 1), we introduce two optimization problems as follows. The first optimization 
problem is written as 

W,i,) ^min ff log [2.e(Pi + aP2 + 1)] - f log(2.e) + ^ log + ^ + J^fj^) (158) 

+ fl2fh (^PlA,N21,b',-^^ + MP2,N22,l,a,fil) 

subject to: 

fJ-i + /^2 = M 
b'g2 = b 
b' < N21 
aN22 < 1 

(1 - V9^)^N22+ 92N21 = 1 

< [/^l,/^2,6',<?2,A^22,A^2l]. 

In fact, the objective of the above minimization problem is an upper bound on the support function of a channel within Class 
Al which is obtained in Lemma [3] The constraints are the combination of dTSl l and ( |79] l which are applied to guarantee the 
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admissibility of the channel and to validate the upper bound obtained in Lemma [3] Hence, cr^(/i, 1) < Wi{fi). By using a 
new variable 5 = (1 — -1/52)^^22, we obtain 



Wiifi) =min ^ log [2^e(Pi + aPa + 1)] + ^ log 



+ fl2fh [Pi, I ^ 

subject to: 

^J■l + M2 = M 

S <l-b 

~ a 
< [Mi,M2,<S',g2]- 

The second optimization problem is written as 

Pi . Pi 



92 



S b_ J_ 

92 M2 



A (^2 



2,1 - 5 + 5Pi 



525 



il^^)^P2 + S' 



s 



W2 (/i) =min /17 



+ 7 



P2 

A^22 



P2 



b'Pi + N21 



Nn a'P2 + N12 
+ g log((2^e)(a'P2 + N12)) - ^ log((2^e)(P2 + N22)) 



+ MP2,N22,Ni2,a,fJ.) 



1 



(159) 



(160) 



subject to: 

a'gi = a 
b'g2 = b 
b'Nn < N21 
a'N22 < N12 

(1 - ^/glfNii + giiVi2 = 1 
(1 - ^/9ifN22 + g2N2i = 1 
< [a', 6', gi, 5-2, A^ii, iVi2, iV22, iV2i]. 

For this problem. Class B channels are used. In fact, the objective is the upper bound on the support function of channels 
within the class obtained in Lemma |5] and the constraints are defined to obtain the closed form formula for the upper bound 
and to confirm that the channels are admissible. Hence, we deduce cr<i^(/z, 1) < W2(m)- I^Y using new variables Si = giNi2 
and 5*2 = 32-^21 , we obtain 



W2 (/i) ==min fi'j 



:i 



^i?Pi 



giPi 



l-Si 

+ h (P2, 



+ 7 



aP2 + Si 
1 — S2 Si a 



(1 - Vg^)^P2 
1-^2 



.92-P2 



bPi + S2 



(161) 



(1 - %/52)2 51 .91 ' 



log (2^e)( 



aP2 + Si 
91 



log (2^e)(P2 + 



(1 



/.92j 



subject to: 

b{l-Si) 



(1- 
a(l 



<S2<1 



91) 
S2) 



(1 - ^/52)2 

< [51:52]- 

In a similar fashion, one can introduce two other optimization problems, say Wi{ijl) and W2{n), to obtain upper bounds on 
(T<^(l,/i) by using the upper bounds on the support functions of channels in Class A2 and Class B. 

Theorem 6 (New Outer Bound): For any rate pair (Pi, P2) achievable for the two-user weak Gaussian IC, the inequalities 



PL1R1+R2 < Wi^ii) = mm{Wli^ll),W2i^ll)}, 
i?i+M2i?2 < W^(/i2) =niin{ 1^1(^2), T^2(Ai2)}, 

hold for all I < fii, 112- 

To obtain an upper bound on the sum rate, we can apply the following inequality: 



< min 

l<pi,P2 



(m2 - i)w{m) + (mi - i)w{ii2) 

MlA^2 - 1 



(162) 
(163) 



(164) 
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Fig. 9. for the weak Gaussian IC. ri, r2, r^, and r4 are extreme points of in the interior of the first quadrant. 



C. Han-Kobayashi Achievable region 

In this sub-section, we aim at characterizing for the weak Gaussian IC. To this end, we first investigate some properties of 
^^o(^i7 ^2, oi; P)- First of all, we show that none of the inequalities in describing % is redundant. In Figure|9] all possible extreme 
points are shown. It is easy to prove that r[ ^ % for i G {1,2,..., 6}. For instance, we consider t'q — (^^^k^zJll^^ 2-05 --04 ^ 
Since -031 + V'32 + '033 = "04 + V's (see Section II. C), we have 



03= min{03i,l/'32,033} 
< o(V^31 + "032 + 033) 



(■04 + V's)- 



However, ^ (^^4 -I- 
region. Hence, rg 



t/^s) is the sum of the components of rg. Therefore, rg violates O in the definition of the HK achievable 
^ As another example, let us consider r'^ = (■0i,-03 — 0i). We claim that r'^ violates ([8]l. To this end, 
we need to show that ^04 < V'3 + "01- However, it is easy to see that V'4 < 03i + 0i, 0^4 < 032 + i^i, and ?/'4 < -033 + '01 
reduce to < (1 - a)(l - 6 + /3(1 - a6)P2), < (1 - /3)(1 - a + (1 - ab)Pi), and < (1 - a)(l - [i)aP2, respectively. 
Therefore, r'^ ^ 

We conclude that has four extreme points in the interior of the first quadrant, namely 



ri= (0'i,04 " 2?/;i), 

r2= (0'4 - 0'3,2V'3 - 0'4), 
r3= (2-03 - 05,0'5 - 03), 
'ri= (05 - 202,02)- 



(165) 
(166) 
(167) 
(168) 



Most importantly, % possesses the unique minimizer property. To prove this, we need to show that y, the minimizer of the 
optimization problem 

CTDo(ci,C2,Pi,P2,a,/3)= max{cii?i +C2P2IAR < *(Pi, P2, a, /?)} 



= min{y**(Pi, P2, a, P)\A'y = (ci, ca)*, y > 0}, 



(169) 



is independent of the parameters Pi, P2, a, and j3 and only depends on ci and C2. We first consider the case (ci, C2) = (/^, 1) 
for all 1 < /i. It can be shown that for 2 < /i, the maximum of (1 169b is attained at ri regardless of Pi, P2, a, and /3. Therefore, 
the dual program has the minimizer y = (/x — 2, 0, 0, 1, 0)* which is clearly independent of Pi, P2, a, and /3. In this case, we 
have 

c7Do(a', 1,^,^2, a, /?) = (m- 2)^1 +04, 2<Ai. (170) 

For 1 < /i < 2, one can show that r2 and y = (0, 0, 2 — /x, /i— 1, 0)* are the maximizer and the minimizer of ( |1691 l, respectively. 
In this case, we have 

ai5„(Ai,l,Pi,P2,a,/5) = (2-/i)03 + (/i-l)04, 1<M<2. (171) 

Next, we consider the case (ci,C2) = (1,/i) for all 1 < /i. Again, it can be shown that for 2 < fj, and 1 < /J. < 2, 
y = (0, /i — 2, 0, 0, 1)* and y = (0, 0, 2 — /i, 0, /i — 1)* minimizes ( |169l l, respectively. Hence, we have 



CrDo(l,^, Pi,P2,Q;,/5)= (/i - 2)l/i2 + 05, if 2 < ^, 

(1, Pi,P2, a, p)= (2 - fi)tlj3 + (m - 1)05, if 1 < ^ < 2. 



(172) 
(173) 
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We conclude that the solutions of the dual program are always independent of Pi, P2, a, and (3. Hence, % possesses the 
unique minimizer property. 

Theorem 7: For the two-user weak Gaussian IC, time-sharing and concavification result in the same region. In other words, 
can be fully characterized by using TD/FD and allocating power over three different dimensions. 
Proof: Since % possesses the unique minimizer property, from Theorem [T] we deduce that = ^^2- Moreover, using 

Theorem [3] the number of frequency bands is at most three. □ 
To obtain the support function of we need to obtain g{c\^ C2, P\^ P-i^Oi, /3) defined in ( l43T l. Since % possesses the unique 

minimizer property, (|43]) can be simplified. Let us consider the case where (ci, C2) = (p, 1) for /i > 2. It can be shown that 

for this case 

5= max (^-2)^1 (Pi, P2, ^4(Pi,P2, a, /?). (174) 

(Q,/3)e[04]^ 

Substituting into ( |42] |. we obtain 

3 

a<^,{^l, 1, Pi, P2) =max^ A, [(/x - 2)^/^1 (Pi», P2^,a„p,) + MPu, P2^,a^, ft)] (175) 

i=l 

subject to: 

1=1 
3 

< Pi 

i=l 
3 

^ A,P2, < P2 

i=l 

0< A„0<Pi„0<P2„ Vie {1,2,3} 
< tti < 1, < I3i < 1, e {1, 2, 3}. 

For other ranges of (ci, C2), a similar optimization problem can be formed. It is worth noting that even though the number 
of parameters in characterizing is reduced, it is still prohibitively difficult to characterize boundary points of In Figures 
(ITOl l and ( fTTT i. different bounds for the symmetric weak Gaussian IC are plotted. As shown in these figures, the new outer 
bound is tighter than the previously known bounds. 

V. One-sided Gaussian Interference Channels 

Throughout this section, we consider the one-sided Gaussian IC obtained by setting 6 = 0, i.e, the second receiver incurs no 
interference from the first transmitter. One can further split the class of one-sided ICs into two subclasses: the strong one-sided 
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Fig. 11. Comparison between dilferent bounds for the symmetric weak Gaussian IC when P = 100 and a = 0.1. 

IC and the weak one-sided IC. For the former, a > 1 and the capacity region is fully characterized [16]. In this case, the 
capacity region is the union of all rate pairs (i?i,i?2) satisfying 



For the latter, a < 1 and the fuU characterization of the capacity region is still an open problem. Therefore, we always assume 
a < 1. Three important results are proved for this channel. The first one, proved by Costa in [11], states that the capacity 
region of the weak one-sided IC is equivalent to that of the degraded IC with an appropriate change of parameters. The second 
one, proved by Sato in [10], states that the capacity region of the degraded Gaussian IC is outer bounded by the capacity 
region of a certain degraded broadcast channel. The third one, proved by Sason in [16], characterizes the sum capacity by 
combining Costa's and Sato's results. 

In this section, we provide an alternative proof for the outer bound obtained by Sato. We then characterize the full HK 
achievable region where Gaussian codebooks are used, i.e., 

A. Sum Capacity 

For the sake of completeness, we first state the sum capacity result obtained by Sason. 

Theorem 8 (Sason): The rate pair ^7 (^j^^js-^ , liP2)j is an extreme point of the capacity region of the one-sided Gaussian 
IC. Moreover, the sum capacity of the channel is attained at this point. 

B. Outer Bound 

In [10], Sato derived an outer bound on the capacity of the degraded IC. This outer bound can be used for the weak one-sided 
IC as well. This is due to Costa's result which states that the capacity region of the degraded Gaussian IC is equivalent to that 
of the weak one-sided IC with an appropriate change of parameters. 

Theorem 9 (Sato): If the rate pair (i?i, R2) belongs to the capacity region of the weak one-sided IC, then it satisfies 



for all (i G [0, 1] where P = Pi/a + P2- 

Proof: Since the sum capacity is attained at the point where User 2 transmits at its maximum rate R2 = j{P2), other bound- 
ary points of the capacity region can be obtained by characterizing the solutions of it<^(/z,1) = max{/ii?i + R2\{Ri, R2) S 



Ri< i{Pi), 

Ri+R2<l{Pi+aP2). 




(176) 
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for all 1 < /i. Using Fano's inequality, we have 

nifxRi + R2) <iil[xl-v1) + /(x^; + n6„ 

=lih{y^) - lJih{y'^\xr) + h(y^) ~ h{y^\x^) + ne„ 

= [^ih{x'l + y/ax^ + z"^) - h{z^)] + [h{x'^ + zj) - /i/^(^^a;^ + z]")] + 

< ^ log [2ne{Pi + aP2 + 1)] " 2 ^^^Si^ne) + + zj) - nh{V^x'^ + z^] + 

< ^ log [27re(Pi + aP2 + 1)] - 2 log(27re) + 7iA(P2, 1, 1, a, + 

where (a) follows from the fact that Gaussian distribution maximizes the differential entropy for a given constraint on the 
covariance matrix and (b) follows from the definition of fh in ( |59] ). 
Depending on the value of /i, we consider the following two cases: 

1- For 1 < ^ < ^p^_^(° , we have 

fiRi + i?2 < (y^^) + 7(^2). (177) 

In fact, the point ^7 ^ i+aP2 ) '7(-^2)^ which is achievable by treating interference as noise at Receiver 1, satisfies (11771 ) with 
equality. Therefore, it belongs to the capacity region. Moreover, by setting /i = 1, we deduce that this point corresponds to 
the sum capacity of the one-sided Gaussian IC. This is in fact an alternative proof for Sason's result. 

2- For ^^r^^i"^ < u < -, we have 

P2 + I ' — a' 



Equivalently, we have 



(178) 



where P = Pi/a + P2. Let us define Ei as the set of all rate pairs {Ri, R2) satisfying ( |179l l, i.e. 

(, n f (aP + l)(n - l)\ 1 /l/a-l\ F2 + 1/a ll 

We claim that Ei is the dual representation of the region defined in the statement of the theorem, see (|4]i. To this end, we 
define E2 as 

i?2 = |(Pi,P2)|i?i<7(Y^^^^),i?2<7(/3P), V/3e [0,1]| (181) 
We evaluate the support function of E2 as 

cr£;,(Ai, 1) = max{/iPi + i?2|(Pi,i?2) e . (182) 
It is easy to show that f3 = p['^Z\) maximizes the above optimization problem. Therefore, we have 

, ^, /i, /(aP + l)(/i- 1)\ 1 /l/a-l\ 

Since i?2 is a closed convex set, we can use dUl to obtain its dual representation which is indeed equivalent to (1180b . This 
completes the proof. □ 

C. Han-Kobayashi Achievable Region 

In this subsection, we characterize ^^i, ^^2, and for the weak one-sided Gaussian IC. % can be characterized as 
follows. Since there is no link between Transmitter 1 and Receiver 2, User I's message in the HK achievable region is only 
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the private message, i.e., a = 1. In this case, we have 



^-^(tt^)' ''''' 

i^2^l{P2), (185) 

^3.= 7r^^^^fe^V7(m), (186) 



1 + a(3P: 



2 



^32=7(^^)+7(P2), (187) 
^33= 7 ('^^^^fe^) + 7(m), (188) 



1 + a(3P: 



2 



, + , f ^ I + ,(„P,,. (,89) 



1 + a/3P2 / V 1 + apP2 

i{fiP2) + i{P2) + i( + , (190) 



1 + a/3P2 

It is easy to show that -03 = min{-!/)3i, '032, 033} — ■03i, '031 + 0i = "04' ^^31 +^02 = V's- Hence, can be represented as all 
rate pairs (_Ri,i?2) satisfying 

Ri<l{ , (191) 



vl + a/?P2/ 

i?2< 7(^2), (192) 
^1 + 7 f ^^^fe^) + 7(/3P2). (193) 



1 + apP2 

We claim that ^^2 = To prove this, we need to show that % possesses the unique minimizer property. % is a pentagon 
with two extreme points in the interior of the first quadrant, namely ri and r2 where 

..^f.f^VJ ('--^)°f» U,(,P,)V ,1,4, 



l + a/3P2/ ' Vl + Pi+/9aP2 

+ + ^(^^2) - 7(^2), 7(^2)) . (195) 

Using above, it can be verified that % possesses the unique minimizer property. 

Next, we can use the optimization problem in (l42b to obtain the support function of However, we only need to consider 
(ci, C2) = (/i, 1) for /i > 1. Therefore, we have 

g{p, 1, Pi, P2, /?) = max ^7 ( , + 7(/3P2) + 7 f 7X^4^^) ' (1^6) 

0</3<i \ 1 + /i<aP2 / \ 1 + Pi + /?aP2 J 

Substituting into ( |42] ). we conclude that boundary points of can be characterized by solving the following optimization 
problem: 



W =max ^ A 



' ' r«P^^ ^ (1-A)«^2. 



=1 



(197) 



subject to: 



3 



Ea. = i 

i=l 
3 

KPu < Pi 



i=l 
3 



J2 A^^2» < P2 



i=l 



0<(3^<l, Vie {1,2,3} 

0< [Ph,P2,,A,], Vie {1,2,3}. 

For the sake of completeness, we provide a simple description for ^#i in the next lemma. 
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Sato's Outer Bound 




0.1 0.2 0.3 0.4 0.5 

Fig. 12. Comparison between different bounds for the one-sided Gaussian IC when Pi = 1, P2 = 7, and a = 0.4. 



Lemma 7: The region can be represented as the collection of all rate pairs (i?i,i?2) satisfying 



Ri<l 



1 + a/3'P2 



R2<l{P'P2)+l 



a(l - I3')P2 
al3'P2 



1 + Pi 



(198) 
(199) 



for all /?' e [0, 1]. Moreover, is convex and any point that lies on its boundary can be achieved by using superposition 
coding and successive decoding. 

Proof: Let E denote the set defined in the above lemma. It is easy to show that E is convex and i? C ^^i. To prove 
the inverse inclusion, it suffices to show that the extreme points of ri and r2 (see (I194l i and ( I195l l) are inside E for all 
/3 e [0, 1]. By setting (3' ~ f3, we see that ri € E. To prove r2 G E, we set /?' = !. We conclude that r2 E E if the following 
inequality holds 



1 + apP2 



l + aP2 



for all /? e [0, 1]. However, d200] i reduces to < (1 - a) (1 - f3)P2 which holds for all /3 e [0, 1]. Hence, ^1 C Using these 
facts, it is straightforward to show that the boundary points are achievable by using superposition coding and successive 
decoding. □ 
Figure [T2l compares different bounds for the one-sided Gaussian IC. 



VI. Mixed Gaussian Interference Channels 

In this section, we focus on the mixed Gaussian Interference channel. We first characterize the sum capacity of this channel. 
Then, we provide an outer bound on the capacity region. Finally, we investigate the HK achievable region. Without loss of 
generality, we assume o < 1 and b > 1. 
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A. Sum Capacity 

Theorem 10: The sum capacity of the mixed Gaussian IC with a < 1 and & > 1 can be stated as 

- 7 iP.) + min {, (^) , , (^) } . (201) 

Proof: We need to prove the achievability and converse for the theorem. 
Achievability part: Transmitter 1 sends a common message to both receivers, while the first user's signal is considered as 
noise at both receivers. In this case, the rate 

«--"{-'(t^)-(t^)} 

is achievable. At Receiver 2, the signal from Transmitter 1 can be decoded and removed. Therefore, User 2 is left with a 
channel without interference and it can communicate at its maximum rate which is 

i?2=7(^2). (203) 

By adding ( |202| l and ( |203l l. we obtain the desired result. 

Converse part: The sum capacity of the Gaussian IC is upper bounded by that of the two underlying one-sided Gaussian 
ICs. Hence, we can obtain two upper bounds on the sum rate. We first remove the interfering link between Transmitter 1 and 
Receiver 2. In this case, we have a one-sided Gaussian IC with weak interference. The sum capacity of this channel is known 
[16]. Hence, we have 

'^surn < 7(^2) + 7 ( Y^^) • (204) 

By removing the interfering link between Transmitter 2 and Receiver 1, we obtain a one-sided Gaussian IC with strong 
interference. The sum capacity of this channel is known. Hence, we have 

%um < ^ (bPi + P2) , (205) 

which equivalently can be written as 

%um < 7(^2) + 7 (yTT^) ■ (2^^^ 

By taking the minimum of the right hand sides of ( |204| l and ( |206t . we obtain 

< 7 (^2) + min {, (^) , , (^) } . (207) 

This completes the proof. □ 
Remark 4: In an independent work [25], the sum capacity of the mixed Gaussian IC is obtained for a certain range of 
parameters, whereas in the above theorem, we characterize the sum capacity of this channel for the entire range of its parameters 
(see also [24]). 

By comparing 7 (^j^^^^ with 7 . we observe that if 1 + P2 < & + abP2, then the sum capacity corresponds to the 

sum capacity of the one-sided weak Gaussian IC, whereas if 1 + P2 > b + abP2, then the sum capacity corresponds to the 
sum capacity of the one-sided strong IC. Similar to the one-sided Gaussian IC, since the sum capacity is attained at the point 
where User 2 transmits at its maximum rate R2 — 7(^2), other boundary points of the capacity region can be obtained by 
characterizing the solutions of <J'g{^, 1) = max{/ii?i + i?2|(i?i, -R2) G "^j for all 1 < /x. 

B. New Outer Bound 

The best outer bound to date, due to Etkin et al. [13], is obtained by using the Genie aided technique. This bound is the 
union of all rate pairs (i?i,i?2) satisfying 

Ri< 7(^1), (208) 
R2< l{P2), (209) 

Ri+R2<l{P2 + bPi), (211) 
2R, + R2< 7(P. + aP2) + 7 U + ^) + 7 (^) • (212) 
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The capacity region of the mixed Gaussian IC is inside the intersection of the capacity regions of the two underlying one- 
sided Gaussian ICs. Removing the link between Transmitter 1 and Receiver 2 results in a weak one-sided Gaussian IC whose 
outer bound Ei is the collection of aU rate pairs -R2) satisfying 

R2< lifiP'), (214) 

for all /? G [0, /?max], where P' = Pi/a + P2 and /3n,ax = WJT+P^- other hand, removing the link between Transmitter 

2 and Receiver 1 results in a strong one-sided Gaussian IC whose capacity region E2 is fully characterized as the collection 
of all rate pairs (i?i,i?2) satisfying 

Ri< 7{bPi), (215) 
R2<7{P2), (216) 
Ri+R2<7{bPi+P2). (217) 
Using the channels in Class C, we upper bound o-^(/i, 1) based on the following optimization problem: 

W{fi) =min ^ log (27re(Pi + aPj + 1)) + ^ log (^27re [j^^^ + b'Pi + N^i^ ) (218) 

- i log(27reiV2i) - ^ log(27reiV22) + h{P2,N22, l,a,/x - 1) 
subject to: 

b'g2 = b 
b' > N21 
aN22 < 1 

(1 - V9^fN22 + ff2iV21 = 1 

0< [b',g2,N22,N2i]. 
By substituting S = 5(2-^21. we obtain 

WM .„6 + aF. + D) + i log (2„ (^^-^^^ + 5£l±^ )) (2,9, 

1, /2TreS\ 1 /27re(l-5)\ , /„ 1-5 

log ' ' ' /li-tID 



2 V 52 
subject to: 

S< 1 

a(l - 5) < (1 - 
0< [5,52]. 

Hence, we have the following theorem that provides an outer bound on the capacity region of the mixed Gaussian IC. 
Theorem 11: For any rate pair (i?i,i?2) achievable for the two-user mixed Gaussian IC, (-Ri,i?2) € E\ n^2- Moreover, 
the inequality 

IjlRx +R2< Will) (220) 

holds for aU 1 < fi. 



C. Han-Kohayashi Achievable Region 

In this subsection, we study the HK achievable region for the mixed Gaussian IC. Since Receiver 2 can always decode the 
message of the first user. User 1 associates aU its power to the conmion message. User 2, on the other hand, allocates PP2 
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Alternating Regions 








Ri 



Fig. 13. The new region which is obtained by enlarging 



and (1 — /3)P2 of its total power to its private and common messages, respectively, where (3 G [0, 1]. Therefore, we have 

^1-7 ^ „ ) , (221) 



, 1 + a(3P2 , 

i>2^l{P2), (222) 
^^31= 7 T"^ — TTF, + 7(/3^2), (223) 



1 + apP2 

V'32=7(^2 + foPi), (224) 

^33= 7 f TT^Ii?) + 7(/3P2 + fePi), (225) 

^4= 7 f ^^^^fe^) + 7(/3P2 + (226) 



1 + a/3P2 

V'5=7(/3^2) +7(^^2 + 6^1) +7 f^^-ll^). (227) 

Due to the fact that the sum capacity is attained at the point where the second user transmits at its maximum rate, the 
last inequality in the description of the HK achievable region can be removed. Although the point rg = [ip^ ~ 7(^2), 7(Pi)) 
in Figure |9] may not be in this point is always achievable due to the sum capacity result. Hence, we can enlarge % by 
removing 7-3 and r^. Let us denote the resulting region as J^q. Moreover, one can show that r'2, r'^, r'^, and rg are still outside 
^^Q. However, for the mixed Gaussian IC, it is possible that r'^ belongs to J#q. In Figure [13] two alternative cases for the region 
along with the new labeling of its extreme points are plotted. The new extreme points can be written as 

ri= (V'l, V'4 - S'f/;!), 
r2= (-01, V'a - V'l), 

''3= (-04 - 03,203 - -04), 
ri= {lp3 - 02, V'2)- 

In fact, we have either = convjri, r^, r^} or = conv{r2, r4}. 
To simplify the characterization of J^i, we consider three cases: 

Case I: 1 + P2 < 6 + 06^2- 

Case II: 1 + P2 > b + abP2 and 1 - a < abPi. 

Case III: 1 + P2 > b + abP2 and 1 - a > abPi. 

Case I (1 + P2 < b + abP2y. In this case, -03 = -031. Moreover, it is easy to verify that -031 + "01 ^ "04 which means dHJ is 
redundant for the entire range of parameters. Hence, = conv{r2, consists of all rate pairs (i?i, R2) satisfying 

i?i< 7 r .^\r. ) ' (228) 



,l + a^P2. 

i?2<7(^2), (229) 

i?i + i?2< 7 r ^^+f ;/)^^ ) + ^(^P,), (230) 
V 1 + a^P2 / 
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where (3 G [0, 1]. Using a reasoning similar to the one used to express boundary points of for the one-sided Gaussian IC, 
we can express boundary points of '^i as 

Ri<l{-^^\ (231) 
( a(l-B)P2 \ 

for all (3 € [0, 1]. 

Theorem 12: For the mixed Gaussian IC satisfying 1 < ab, region J# is equivalent to that of the one sided Gaussian IC 
obtained from removing the interfering link between Transmitter 1 and Receiver 2. 

Proof: If 1 < ab, then 1 + P2 < 6 + a&^2 holds for all Pi and P2. Hence, ^^(Fi, P2, /?) is a pentagon defined by ( |228] ). 
(|229l l, and (|229l l. Comparing with the corresponding region for the one-sided Gaussian IC, we see that is equivalent to % 
obtained for the one-sided Gaussian IC. This directly implies that ?f is the same for both channels. □ 

Case II (1 + P2 > ^ + abP2 and 1 — a < abPi): In this case, V's — niin{?/'3i, ■!/'32}. It can be shown that is the union 
of three regions Ei, E2, and E^, i.e, % = ^'i U -^2 U -^3- Region Ei is the union of all rate pairs i?2) satisfying 

R2< liPKl+-<( ,f~^''' ]~ (234) 

V 1 + "1 + ap "2 / 

for all /3 e [0, (Tt^jj^]- Region E2 is the union of all rate pairs {Ri, R2) satisfying 

^^^'{-ITw)^ ''''' 

( Pi+a{l-p)P2 \ , ,.p. /^_^^ 

for all (3 e [ (iJ'nfc^)p^ , (/-ab)jC jt+(~i-^<^P2 ] - Rsgio" ^3 is the union of all rate pairs (i?i, P2) satisfying 

i?,< ^ i^i^ , (237) 

' \^ 1 + 5Pi + P2 y ' 

i?2<7(-P2), (238) 
Ri+R2<i{bPi + P2). (239) 

Case III (1 + P2 > b + abP2 and 1 — a > abPi): In this case, ■(/'a = minlV^si, -032}. Similar to Case II, we have ^1 = 
^'1 U ^2 U ^3' where regions Ei, E2, and E^ are defined as follows. Region Ei is the union of all rate pairs (Pi,P2) 
satisfying 

^^^^ fTT%r) ' (240) 

for all /3 e [0, (j-i^^jjpj]- Region is the union of all rate pairs (Pi, P2) satisfying 

Ri< 7 ( , ) , (242) 

^ f l!Vf?p ) + ^(^^^ + - ^ (tT^t) • (243) 
V 1 + "1 + apP2 J V 1 + apP2 / 

for all (3 G [ (i-Qfc)P3 1 !]■ Region E's is the union of all rate pairs (Pi, P2) satisfying 

^ (iq^) ' ^^''^ 

i?2<7(^2), (245) 

Ri+R2<l{bPi+P2). (246) 

Remark 5: Region P3 in Case II and Case III represents a facet that belongs to the capacity region of the mixed Gaussian 
IC. It is important to note that, surprisingly, this facet is obtainable when the second transmitter uses both the common message 
and the private message. 

Different bounds are compared for the mixed Gaussian IC for Cases I, II, and III in Figures [141 HSl and [16] respectively. 
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ETW Outer Bound 




0.5 1 1.5 

Fig. 14. Comparison between different bounds for the mixed Gaussian IC when 1 + P2 < 6 + abP2 (Case I) for Pi = 7, P2 = 7, a = 0.6, and b = 2. 



1.5r— ^ 



^ X ETW Outer Bound 




Fig. 15. Comparison between different bounds for the mixed Gaussian IC when 1 + P2 > 6 + afeP2 and 1 — a < abPi (Case II) for Pi = 7, P2 = 7, 
a = 0.4, and b = 1.5. 



VII. Conclusion 

We have studied the capacity region of the two-user Gaussian IC. The sum capacities, inner bounds, and outer bounds have 
been considered for three classes of channels: weak, one-sided, and mixed Gaussian IC. We have used admissible channels as 
the main tool for deriving outer bounds on the capacity regions. 

For the weak Gaussian IC, we have derived the sum capacity for a certain range of channel parameters. In this range, the 
sum capacity is attained when Gaussian codebooks are used and interference is treated as noise. Moreover, we have derived a 
new outer bound on the capacity region. This outer bound is tighter than the Kramer's bound and the ETW's bound. Regarding 
inner bounds, we have reduced the computational complexity of the HK achievable region. In fact, we have shown that when 
Gaussian codebooks are used, the full HK achievable region can be obtained by using the naive HK achievable scheme over 
three frequency bands. 

For the one-sided Gaussian IC, we have presented an alternative proof for the Sato's outer bound. We have also derived the 
full HK achievable region when Gaussian codebooks are used. 

For the mixed Gaussian IC, we have derived the sum capacity for the entire range of its parameters. Moreover, we have 
presented a new outer bound on the capacity region that outperforms ETW's bound. We have proved that the full HK achievable 
region using Gaussian codebooks is equivalent to that of the one-sided Gaussian IC for a particular range of channel gains. 
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0.5 1 1.5 



Fig. 1 6. Comparison between different bounds for the mixed Gaussian IC when 1 + P2 > b + ahP2 and I ~ a > abPi (Case III) for Pi = 7, P2 = 700, 
a = 0.01, and b = 1.5. 

We have also derived a facet that belongs to the capacity region for a certain range of parameters. Surprisingly, this facet is 
obtainable when one of the transmitters uses both the common message and the private message. 
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